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Abstract 

Symmetries of the auto-cumulant function (the generahzation of the auto-covariance 
function) of a fc^'^-order stationary time series are derived through a connection with 
the symmetric group of degree k. Using theory of group representations, symmetries of 
the auto-cumulant function are demystified and lag-window functions are symmetrized 
to satisfy these symmetries. A generalized Gabr-Rao optimal kernel, used to estimate 
general fc^'^-order spectra, is also derived through the developed theory. 

Keywords: higher-order spectra, group representations, multivariate lag-windows, 
symmetry group 



1 Introduction 

There have been numerous approaches and references to constructing multivariate lag- 
windows that inherit the appropriate symmetries when estimating fc*^-order spectra; cf. 
0; 10) U) l3) 15, 17, 18, 19, [iy]. However, each approach constructs multivariate lag- 



windows from univariate lag-windows which severely limits their shape. The case k = 2, 
corresponding to univariate lag-windows, is trivial since the only symmetry requirement 
imposed is evenness. The symmetries are much more complex in the case A; = 3, and the 
challenge is to fully understand these symmetries and to easily construct lag-windows 
that inherit the symmetries. In the process, an intimate relationship is discovered 
between the symmetries of the bivariate lag- window and the symmetric group Ss- 
Here we begin to see the makings of a beautiful connection between stationary time 
series and group representations of the symmetric group. 

General theory of the symmetric group is pulled back via this connection to the 
context of stationary time series and many useful results are ascertained. Similarly, 
natural constructions of stationary time series are pushed forward into the theory of 
group representations of Sk to elicit a new realization of a familiar representation of 
Sk with dimension k — 1. 
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Current practices of generating multivariate lag-windows are shown to be inade- 
quate as they restrict the construction of many useful lag- windows including the Gabr- 



Rao optimal bivariate lag- window 171], yet this lag- window occurs naturally under 
the symmetrization routine described in this paper. The connection to the symmetric 
group via group representations allows for effortless generalization of the Gabr-Rao 
lag-window for estimation of the fc*^-order spectral density. 

This is not the first time a connection between group representations and statistics 
has been drawn. In fact an entire book Group Representations in Probability and 
Statistics [H], written by the mathemagician Persi Diaconis, exposes many connections 
between group representations and statistics, but the connections drawn in this book 
are much different, with different motivations, from what is done the present article. 

The first section defines stationarity — the driving force in the connection with the 
symmetric group — and introduces higher-order spectral densities and spectral density 
estimates with lag-windows. To solidify ideas, the case = 3 is considered in Section 
2, and a connection between symmetries of the auto-cumulant function (acf) and per- 
mutations of the symmetric group is obtained via a special group representation on 5*3; 
these results are generalized for general k in Section 4. Section 5 describes the actions 
in the Fourier domain viewpoint which allows for the generalization of the Gabr-Rao 
kernel. Finally, several examples of bivariate kernels and lag-windows are produced in 
Section 6. 



2 Stationarity 

Let {Xt} be a fc^'^-order discrete (resp. continuous) stationary time series, that is 



E 



ues 



(1) 



is finite and the same for every t G Z (resp. t G M) where 5 C Z (resp. 5* C M) is any 
set of size at most k — 1. A stronger, yet more typical, assumption is to require the time 
series to be strictly stationary, that is for any positive integer n and any t,xi, . . . ,Xn m 
Z (resp. M), the random vectors {Xx^^, ■ ■ ■ ,Xx„) and {Xt+xi, ■ ■ ■ ,Xt+x„) have identical 
joint distributions. However, this stronger version of stationarity will not be needed 
here since this paper only deals with expressions like those in ([T|). 

As far as this paper is concerned, the theory for discrete time series is the same as 
that for continuous time series, so the remainder of this paper will focus on discrete 
time series only. 

Let's specialize for the moment to the case of second-order (aka weakly) stationary 
time series. The assumption of second-order stationarity requires 

(i) E{Xf) < 00 for all t; 

(ii) E{Xt) = fi for all t; 

(iii) C{x) = E[{Xt - fi){Xt+x - /i)] for all x and t. 

The function C{x) is called the auto-covariance function (referred to within as the 
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auto-cumulant function), and by replacing x with — x, we see that 

C{-x) = E[(Xt - - ^Ji)] = n{Xt' - fi){Xt'+^ - /u)] = C{x) 

(using t' = t+x); so C{x) is an even function, and this is the only symmetry requirement 
of C(x). 

Third-order stationarity is implied by the conditions (i)-(iii) above plus following 
two conditions: 

(iv) E{Xt) < oo 

(v) C{x,y) = E[{Xt - fJ-){Xt+x - n){Xt+y - /i)] for ah x, y, and t. 

In exploring the symmetries of the bivariate acf, C{x,y), we can use the substitutions 
t' = t — X and t' = t — y and the commutativity property of multiplication to deduce 

C(x, y) = C{-x, y-x) = C{y, x) = C{x - y, -y) = C{-y, x-y) = C{y - x, -x) (2) 

There is redundancy in this expression; for instance the relations C{x,y) = C{y,x) = 
C(— X, y — x) imply C(x, y) = C{y — x, — x). This begs two natural questions: 

— Which equations can be removed without losing any information? 

— Is there minimal set of equations representing all of the symmetries? 

We will answer both of these questions in the following sections, but first we return to 
the general case of A;*'^-order stationarity. 

The A:*^-order auto-cumulant function is defined to be the function of A; — 1 variables 
given by 

C{xi,...,xk-i)= Yl (-ir'(p-l)!^.i---/^., (3) 

{ui,...,Up) 

where the sum is over all partitions {vi, . . . , Up) of {0, xi, . . . , Xfe_i} and = E na;,6jyj 
For instance, for /c = 3 we have 

C7(xi,X2) = E{X^Xx,Xx,) - E(XoX,jE(X,J - E(XoX,,) E(X, J 

- nXx.X.,,) E(Xo) + 2E(Xo) E(X,J E(X,J (4) 
= E[(Xt - ^l){Xt+x, - f^){Xt+x, - Ai)] 

The last equality in (|4|) does not generalize for A; > 3, that is C is not the same as the 
more obvious generalization 

C(xi, . . . ,Xfc_i) = E [{Xt - n){Xt+xi - • • • {Xt+xk-i - fJ-)] 

For justification of the use of C as opposed to C, the reader is referred to section 7 
"moments or cumulants?" of 0] and section 2.3.8 "Why Cumulant Spectra and not 
Moment Spectra?" of llj. Nonetheless, we can see from ([3]) that C posses the same 
symmetries as C. The symmetries of the general cumulant function will be explored 
in subsequent sections. 

The k^^-ordev spectral density, f{uj), is defined as the continuous (resp. discrete) 
Fourier transform of the A:*^-order cumulant function, i.e. 
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The second-order spectral density has many uses; see section 1.10, "Time Series Anal 



ysis: Use of Spectral Analysis in Practice", of 13|]. Higher-order spectra are less 
frequently used, but still many applications exist — most notably is the use of the bis- 
pectrum in constructing linearity tests of time series Other applications of 

general higher-order spectra can be found in 11. [l^. 

Estimation of higher-order spectra is typically done with lag-window estimators of 
the form 

/(-) = 7^ E A(t/M)a(t)e-*- 

^ ' \\t\\<N 

where A is a lag-window function whose properties will be described below; M is a 
bandwidth or smoothing parameter; C is an estimate of C produced by estimating the 
expectations /Xj^- in ^ by sample means fi^^ . Indeed, estimation of A;*^-order spectra for 
large k is plagued by the curse of dimensionality, so accurate estimation of high-order 
spectra typically requires a large sample size. 

The lag- window function A can have varying restrictions, but three consistent as- 
sumptions are: 

(a) A(0) = 1; 

(b) ||A||l2 < oo; 

(c) A inherits the symmetries of C. 

It is easy to find functions to satisfy conditions (a) and (b). However, in order to 
satisfy condition (c), we must first determine the symmetries of C and then construct 
a function that possesses these same symmetries. For k = 2 the problem is trivial since 
the only symmetry condition on C{x) is C{x) = C{—x). We can easily impose evenness 
on A by constructing A = h{\{x),\{—x)) where h is any symmetric function (of two 
variables). For instance, if y) = x + y or /i(x, y) = xy^ then A will be forced to be 
even. The solution to the general problem will be addressed in the following sections. 



3 The Case /c = 3 

Equation ([2]) in the previous section exposes the six symmetries of C{x,y), and a 
general technique to arrive at these symmetries is addressed below. We start with 5*3, 
the set of permutations of three labels. Take for instance the permutation a £ S3 
that transposes the first two labels and fixes the third. This permutation is denoted 
as (T = (12)(3), or more simply, a = (12). All six permutations of 5*3 can be written 
down similarly; they constitute the set {e, (12), (23), (13), (123), (132)} where e is the 
identity permutation that doesn't move any labels. 

A methodical procedure is now presented that produces each of the symmetries 
of C{x,y) in ([2]). Under the commutativity property of multiplication, the product 
XtXt-\-xXt-\-y remains the same under any permutation of the three variables, so for 
each permutations in 5*3, we permute the variables, according to the permutation, then 
adjust t so that the first variable has index t. Then the symmetry condition can be 
read off from the last two variables. Say for example we take the permutation a = (12), 
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then we have 

E [XtXt+x^t+y] — > E [Xt+xXtXt+y] — > E [XtXt-xXt+y-x] (5) 

which corresponds to the condition C{x,y) = C{—x,y — x). This process can be 
simphfied shghtly by only writing down the indices and introducing the function ip : 
— > given by ip{a, b,c) {b — a,c — a). Then the example done in (0) is simplified 

to 

(0, X, y) (x, 0, y) {-x, y - x) 

Now suppose we take 2x2 matrix resulting from the coefficients of the x and y variables 
in each coordinate. This induces a mapping from each permutation into the set of 2 x 2 

matrices over M. For instance in the above example we would have the matrix 

corresponding to the permutation a = (12). This also establishes a correspondence 
between S3 and identities on C{x,y). Writing down this correspondence for each 
permutation in 5*3 gives 



(23) ^ r? n " C(y. " (=5 J) " - 



(6) 



We see that each of these matrices is invertible, so in fact there is a map, call it p, from 
the symmetric group S3 to the general linear group GL2(M), the group of invertible 
2x2 matrices over M. Amazingly, there is compatibility in the multiplication, namely 
if we take two permutations a and r and construct the composite permutation 7 = err, 
then in fact ^(7) = p{a)p{T). In other words p is a group homomorphism, and since 
it maps a group to the general linear group of some dimension, it is called a group 
representation. For example, if we take a = (12) and r = (13), then err, going from 
right to left, is equivalent to the permutation 7 = (132), and indeed 



.0 -1^ 

This is more than just a group homomorphism — it is an injective group homomorphism 
since each matrix in the image is distinct. In terms of representation theory, this is 
called a faithful group representation. This means there is no reduction of the group 
structure when viewing the group in terms of its image in the matrix group. A simple 
example of a non-faithful representation is the trivial representation that takes every 
element of the group to the number 1 (which belongs to GLi(M)); in this case all group 
structure is lost in the image of the representation. 

Matrices similar to those in ([6]) appear in the first chapters of most books on group 
representations; for instance one can find a similar set of the matrices on page 7 of 
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Diaconis' book [i"]. Yet the derivation of these matrices typicahy arise in a much 
different context that proceeds as fohows. Let V = {{a,b,c) £ M.^\a + b + c = 0} , a two 
dimensional subspace of M^, and consider the basis {hi = (1,-1,0), 62 = (1,0,-1)}. 
When the coordinates of 61 and 62 are permuted by a permutation in 53, the resulting 
vector still lies in V since the coordinates still sum to 0. The permuted basis vectors are 
rewritten as a linear combination of the original basis vectors, and the coefficients are 
extracted to form a 2 x 2 matrix. For example, a = (12) takes bi to —61 and 62 to 62 — 61 
giving the matrix correspondence for a as the one in ([6]). In fact this representation is 
identical to the one described earlier, even though the general procedures giving rise 
to the representations are completely different. 

We come back to the two questions posed in the previous section by identifying 
each identity in ([2]) with an element of 5*3 via the representation above. If we take the 
identities represented by the permutations a = (12) and r = (23), then we automati- 
cally have the identity corresponding to err = (123), i.e. given C{x,y) = C{—x,y — x) 
and C{x,y) = C{y,x), then C{x,y) = C{—y,x — y) follows automatically by apply- 
ing the first identity to the second. But more identities can be produced from just a 
and r. For instance, (13) = tut, so if we apply the identity C{x,y) = C{y,x) again 
to C{—y,x — y) we pick up C{x — y, —y), i.e. the identity corresponding to the per- 
mutation (13). Actually every identity can be produced from just the two identities 
corresponding to the permutations a and r. This is because these two transpositions 
generate the entire group 5*3. Therefore we can conclude that requiring 

C{x, y) = C{y, x) = C{-x, y - x) 

is equivalent to requiring the entire string of equalities in ([2]), and in general, any set 
of equations is sufficient as long as the corresponding permutations generate all of 5*3. 
So we have established a three-way correspondence between elements of ^3, matrices 
in GL2(M), and symmetries of C{x,y). 

There have been several attempts at constructing lag-window functions possessing 
the same symmetries as in ([2]), i.e. constructing a lag- window X{x,y) that satisfies 

X{x, y) = \{-x, y-x) = X{y, x) = X{x - y, -y) = X{-y, x - y) = X{y - x, -x). (7) 

For instance, the procedures in 17, 3, 10, [11,0] all produce bivariate lag-windows as 
constructed from univariate lag- windows. Specifically, given any even function A(x), 

X{x,y) = X{x)X{y)X{x-y) (8) 

will satisfy the requirements of ([7]). However this class of functions is too restrictive; 
we will see below that the popular Gabr-Rao optimal lag-window is excluded from 
this construction. A much different approach to constructing bivariate lag-windows is 



considered in 20|], but their focus is not on symmetrizing the lag- window function. 



The optimal lag-window, Aopt, introduced in 17[, is optimal in the sense that the 
variance of the bispectral estimators using this lag-window is least among a certain 
class of lag-windows; refer to Theorem 3 in the next section for generalization of this 
property. Aopt is defined by the inv6rs6 Fourier trctnsforni of the optimOjl kernel Aopt 
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given by 




UJ2+ UJiUJ2)^ , if Cjf + w| + UJ1UJ2 < '^'^ 

otherwise 



(9) 



This kernel, as with every kernel, is unique up to scale, i.e. the kernel Aopt(a;,y) is 
equivalent to the kernel hAopt(hx,hy) for any h > 0. Only an approximation of the 



We now ask the question: does Aopt(a;,y) admit a decomposition of the form dH])? 
It is immediately true that no such decomposition exists for a{x, y) since otherwise 
a(0, 0) = would imply A(0) = 0, but then 1) would be forced to be zero (when 
it's not). We extend the answer to Xo-ptix^y) with the following theorem. 

Theorem 1. Suppose X{x,y) is a continuous function that vanishes on a set of measure 
zero, and suppose A does vanish on some smooth nonlinear curve C C M'^. Additionally 
suppose A(0, 0) 7^ 0. Then there is no function f such that A(x, y) = f{x)f{y)f{x — y). 

Proof. Suppose there is a function / such that A(x,y) = f{x)f{y)f{x — y). If f{x) = 
on some interval /, then \{x,y) = for every G I x I contradicting the 

assumption that A vanishes on a set of measure zero. We will now show that /(x) 
is zero on some interval. Since A(0,0) = /(O)^ 7^ 0, we get that /(O) 7^ 0. Also, 
X{x,x) = /(x)^/(0) is continuous so /(x)^ is continuous. Let {x*,y*) be any point on 



C, then f{x*)f{y*)f{x* - y*) = 0, so at least one of /(x*), /(y*), f{x* - y*) must be 



zero. As [x* ,y*) moves continuously along C, the smooth and nonlinear properties of 
C guarantee that x*, y*, x* — y* each simultaneously sweep out some interval of points. 
Continuity of implies the continuity of |/(x*)|, |/(y*)|, and |/(x* — y*)|, but since 
their product is always zero on C, there must be some interval in which / is zero. 



Since Aopt is continuous, vanishes on a set of measure zero, and vanishes on the 
ellipse x^ — xy + = 2, we have the following corollary. 

Corollary 1. The optimal lag-window, Xopt, cannot be written as Xopt{x,y) = f{x)f{y)f{x— 
y) for any choice of function f. 




where 



a{x,y) = —=\lx'^ — xy + y"^ 



and J2 is the second-order Bessel function of the first kind, i.e. 




□ 
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Now we will construct a lag-window satisfying ([7]) from any bivariate function 
f{x,y). We have already seen a symmetrizing technique to create an even function 
in the previous section, and in taking a similar approach to this, we symmetrize f{x, y) 

by 

f{x, y) = h{f{x, y), f{-x, y-x), f{y, x), f{x - y, -y), f{-y, x-y),f{y- x, -x)) (10) 

where h is any symmetric function of its six arguments; for instance h could be a 
power mean like the arithmetic mean or geometric mean. The above construction can 
be made slightly more general by replacing f(x,y) in (jlOp with k{f{x,y)) where k 
is any (univariate) function. On the other hand if we take just the first coordinates 
in each / in the above formula for /, this also produces a function that satisfies the 
required symmetries. That is, if we let 

g{x,y) = h{g{x),g{-x),g{y),g{x - y),g{-y),g{y - x)) 

then g{x,y) is another symmetrization built up from a generic univariate function g, 
and if g is assumed to be an even function, then g{x,y) reduces to 

g{x, y) = h {g{x),g{y),g{x - y)) 

where h is any symmetric function of three variables. Therefore we automatically get 
the construction ([8]) with the special case /i(x, y, z) = xyz. 

To prove that these symmetrizations / and g do satisfy the required symmetries, 
we return to the connection with group representations and also introduce the concept 
of group actions. If we fix any permutation in 5*3 and multiply it with all the elements 
of S3, the resulting action just permutes the six elements of 6*3; this is a type of group 
action where S3 "acts on itself". To see that it is just a permutation of the group 
elements, suppose ar = a^, then multiplying both sides by gives r = 7. Suppose 
we wish to test a symmetry like f{x,y) = f{y,x). We see from ([6|) that this condition 
corresponds to the permutation (23), and multiplying (23) with each element of S3 just 
reorders the elements of S3, i.e. 

(23) • {e, (12), (23), (13), (123), (132)} = {(23), (132), e, (123), (13), (12)} (11) 

The righthand side corresponds to the equation 

f{y, x) = h{f{y, x),f{y - x, -x),f{x, y), f{-y, x - y), f{x - y, -y),f{-x, y - x)) 

and since h is a symmetric function, this is equivalent to the original equation f{x,y). 
In general, since each identity required just corresponds to a permutations of the ar- 
guments of h, each identity will be satisfied. To justify the symmetries of /, we note 
that taking the first coordinate corresponds to multiplying the matrix representation 
with the vector (1 0). So instead of identifying each permutation with the matrix in 
([6]) , it is identified by the 1x2 vector that is the top row of each matrix. Now if we 
wish to show g{x,y) = g{y,x), we see that this is equivalent to multiplying each 2x1 
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vector by the matrix . So again we see that this just induces a permutation of 

the elements, i.e. 




So we see the same theory as before applies as every identity is associated with a 
permutation the arguments of h leaving the value fixed. There is nothing special about 
the vector (1 0), and different vectors (or more generally n x 2 matrices) lead to similar 
formulas. For instance, using the vector (1 1), we see that the function 

9{x + y)g{x - 2y)g{y - 2x) 

also satisfies the required symmetries. 



4 The General Case 

Here we move from the symmetric group of degree three to the symmetric group of 
general degree k. It is well known that any permutation can be written as a product of 
cycles and every cycle is a product of transpositions (2-cycles), therefore the permuta- 
tions can be generated by just the transpositions. But more is true: every permutation 
is the product of transpositions of the form (i, i + 1) for i = 1, . . . , — 1. To see this, 
suppose i < j, then 

(z,j + i) = (j,i + i)(i,j)(i,i + i) 

So any transposition, after applying the above formula enough times, can be reduced to 
a product of transpositions of consecutive labels. Thus the group of k\ permutations 
can be represented by a much smaller subset of only k — 1 transpositions. 

Now a representation of is constructed for general k. We generalize the V' 
function to ^ : R*^ — > given by 

(Xi, X2, . . . ,Xk) ^ {X2 - Xi, . . . ,Xk - Xi) 

and the representation on is produced by composing the permutation with the 
if) function acted on (0,xi, . . . ,Xfe_i), then extracting the {k — l)-dimensional square 
matrix of coefficients from the image in the natural way. As an example, let a = 
(1234) G 54, then 

(0, X, y, z) {z, 0, X, y) {-z, x-z,y- z) 
So we have the correspondence 



(1234) < — ^ 
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This process clearly induces a map from Sk to {k — l)-dimensional square matrices, but 
it is not clear a priori that the matrices are invertible or if this operation establishes a 
group homomorphism. The following theorem proves this operation is indeed a group 
representation, and moreover, is a faithful group representation. 

Theorem 2. The map, referred to now as p, from Sk to {k — 1)- dimensional square 
matrices described above is a faithful group representation of dimension k — 1. 

Proof. Let be the kxk permutation matrix associated to a, i.e. the (i, j)-entry is 1 if 
a maps i to j and otherwise. It is well known that permutation matrices are invertible 
= -Per- 1), in fact the map a ^ P^- is an fc-dimensional group representation of 5^. 
Let A be the matrix with ones in the first column and zeros elsewhere. Then the above 
mapping can be described by associated the permutation a with Pg- then the operation 
of the ■0 function is represented by subtracting each row of Po- by the first row of Pa 
and then discarding the top row and leftmost column. Hence the representation p is 
described compactly by the map 

a^{Pa- APa) = {{I - A)Pa) (12) 

where the notation (B) denotes truncation of the top row and leftmost column of the 
matrix B. From basic block multiplication of matrices, we see that 

{Bi){B2) = {B1B2) (13) 

for any two matrices Bi and B2 the multiply compatibly. Also from block multiplica- 
tion, given any matrix i?, 

{B{I-A)) = {B) (14) 

as long as the multiplication is compatible. We first show that the image of a per- 
mutation under p is invertible, i.e. an element of GLfc_i(R), by exhibiting an inverse. 
Specifically, we show that the inverse of the matrix ((/ — A)Pcj) is ((/ — A)P^^'^. Ap- 
plying properties (fT3]l and (fTll) shows 

((/ - A)Pa) ((/ - A)P~^) = {I-A) {PM - A)) (P-1> 

= {I-A) {Pa){P-') 
= {I-A) 

= {!)■ 

In order to prove /) is a representation, it must be shown that for any a,T G Sk, 
p{a)p{T) = p{aT). This is equivalent to 

((/ - A)Pa) {{I - A)Pr) = {{I - A)PaPr) (15) 

By multiplying both sides on the right by (P^^) and making free use of p^ . equation 
(fTSll becomes 

{{I - A)PM - A)) = {{I - A)Pa) (16) 

(fTBl) is clearly true by noting {Pa{I — A)) = {Pa)- Therefore p is indeed a group 
representation. 
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The final task is to show p is faithful, i.e. p is injective. One easy way to prove this is 
to show that the only permutation being mapped to (I) is e, the identity permutation. 
In the language of group theory, this says that we need to show the kernel of p is the 
identity. So suppose ((/ — A)Pa-) = {I), then by multiplying both sides by {P^^) gives 
(/) = (P^^) = (Pfj-i). This says the permutation a^^ fixes the labels 2 through k, so 
it must also fix label 1. Hence = e = cr, and the only permutation in the kernel of 
p is e. 

□ 

So now that we have established p is a true group representation, we can extend 
the results from the previous section to general k. For instance if p{a) = (aij) for some 
a £ Sk, then the acf of A; — 1 variables satisfies the identity 




(17) 



Also, since the transpositions (12), (23), . . . , {k — 1, k) generate Sk, all of the symme- 
tries of C can be described by ()17p with just these permutations. If we consider the 
permutations that fix the first label, then we see that the identities in (jl7p induced by 
these (A; — 1)! permutations are just permutations of the arguments. Therefore C, in 
particular, is a symmetric function. 

Now with the results of Theorem 2, we can construct lag- windows functions that 
satisfy all of the required symmetries. 

Corollary 2. Given any function f : M.^^^ — > M, a symmetrization of f is constructed 
by starting with any symmetric function, h, of nl variables and plugging-in fa into the 
variables of h ( using every a £ Sk) where, like in [F, 



k-l 



fa{xi, . . .,Xk-l) = / 




5 The Fourier Transform 

We now come back to the motivation of this theory which is to construct higher-order 
spectral density estimates using lag- windows as in ([2]), i.e. estimates of the form 

/(-) = 7^ E A(t/M)a(t)e-*- 
l|t||<^ 

There is an equivalent expression to this estimator in the frequency domain given by 
f{u) = A* P{u) = [ A{t/M)P{uji -t)dt (18) 
where A is the Fourier transform of A and P is the {k — lY^ order periodogram; namely. 



A(t) = / X{t)e-''^-* dt 
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and 

P(,.A = 

(2vr) 



If we were to use the less practical estimator given in (llSp . then we would need a 
kernel A instead of a lag-window A, so an understanding of the symmetries of A is 
also important. To begin, we start by considering an example in the case A; = 3. The 
permutation a = (12) corresponds to the identity \{x, y) = X{—x,y — x), so we consider 
what the substitutions wi i-^ —uji and uj2 uj2 — uJi do to A; 



OO POO 



X{x, y)e-^^i(-^-2')e-^'^2(^) dx dy. 



— OO J — OO 
OO fOO 



— OO J — OO 



After the simplification in the second line, the actions of the Fourier transform begin 
to unfold, i.e. the exponential kernel of the Fourier transform induced the transposed 
representation x ^ —x — y and y ^ y. Now if we start with this transposed identity, 
i.e. LVi ^ —uji — LV2 and 0J2 ^ we find this produces a symmetry of A: 



00 fOO 



-00 J —00 
00 roo 



-00 J —00 

00 /"OO 



—00 J —00 



A(-x, y - x)e-*'^i^e-*'^2y 

= A(tJi,LJ2) 

Thus we have a pseudo-representation p which maps o" G S'fc to p(cr)', the transpose 
of /o(cr). For a,T £ S^, This representation satisfies 

p(o-t) = p(crT)' = (p(o-)p(t))' = p(r)V(a)' = p(r)p(o-) 

which is close to the requirement of a group representation, just reversed. However if 
we consider multiplication of two permutations in Sk in the opposite order, from left 
to right, then in fact p is a true group representation with the same properties as p. 
For example the multiplication (12)(13) from left to right becomes (123) and not (132) 
as in the right to left situation. Therefore we see that A possesses the "transposed 
symmetries" as those of A, i.e. if p{a) = (bij) for some a S Sk, then A satisfies 

(n— 1 n—1 \ 

^Ij'^i ' • • • ' ^n-l,j^^j ■ (19) 

In particular, like in the case of the lag-window A, A is a symmetric function. 
As an example, the symmetries for k = 3 are given by 

A(x, y) = A{-x -y,y)= A(y, x) = A(x, -x - y) = A(y, -x - y) = K{-x - y, y) 
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A construction of the bivariate kernels from univariate kernels similar to ([8]) is easily 
derived using the theory described to become A(x, y) = A(x)A(y)A(— x — y) where the 
only requirement on the univariate A(x) function is nonnegativity. 

We shall now apply this theory to generalize the Gabr-Rao optimal window, Aopt, 
for estimation of the trispectrum and general A;*'^-order spectra. The window must 
integrate to one and satisfy (fT9l) for every a £ Sk- Observing that the usual Gabr-Rao 
window is a constant plus a homogeneous polynomial of degree 2 (inside its support), 
we start with the construction 

Aopt(wi, . . .,LOk-l) = C + '^CijLJiLJj (20) 

i<j 

for some constants c and Cij. Since Aopt is a symmetric function, equation (pOj) can be 
greatly simplified to 

Aopt(wi, . . . ,a;fc-i) = c + ci'^ivf + C2 ^ (21) 

1=1 i<j 

Here we see the convenience in associating the symmetries of A to S^- We showed 
earlier that it is not necessary to test all of the possible identities of A, just the ones 
that correspond to the permutations that generate S^- Since equation ([2T]) accounts 
for the symmetries corresponding to the permutations that fix 1, all that is left is to 
check the identity corresponding to the permutation (12), i.e. if Aopt satisfies 

Aopt(wi, . . . ,LJk-i) = Aopt I - ^t^j , "^2 , • • • , 1 (22) 

then Aopt satisfies all of the necessary symmetries. Applying ([22]) to (i2T]) gives 

fc-1 

Aopt(a;i, . . . , uJk-i) = c + ci ^ + C2 ^ WjWj (23a) 

i=l i<j 
fc-1 fc-1 

= c + ciiol + C2UJ1 ^ Uj + ci ^ uj'^ + C2 ^ Ldiitij (23b) 

j=2 i=l l<i<j 

(fc-1 \ 2 /fc-1 \ fc-1 fc-1 

^ J - C2 i^OJij^UJj + Cl^Ujf + C2 ^ UJiUJj (23c) 

i=l / \i=l / j=2 i=l l<i<j 

Comparing (I23bp with (I23cp gives the identity 

fc-1 /fc-1 \ 2 /fc-1 \ fc-1 

ci^i + C2u;i ^ujj = ci l^uJij - C2 ^ 1 ^ ojj (24) 

j=2 \i=l J \i=l J j=2 



There are many different ways to massage (|24p into conditions on ci and C2. Here we 
differentiate both sides of ()24p with respect to wi producing 



fc— 1 n k—1 

i=2 1=1 i=2 



2ciu;i + C2 ^ = 2ci ^ - C2 ^ I 
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which is equivalent to 

k-l k-l 

2ci '^uji = 2c2 ^ uji 

i=2 i=2 

Therefore we must have ci = C2, and it is easily seen that (j24p is satisfied under this 
condition. Thus 

Aopt(wi, • • • ,Wfc-i) = a 1 - /? ^u,^ + ^WjWj j j (25) 

\ \i=l i<j 

satisfies all of the necessary symmetries. We now define Aopt(^i') as 
. / ^ T. ^4- fA(cj), if A(a;) > 

AoptM=A(a;)+= ^ ^' ^ ^. " (26) 
I (J, otherwise 

where /3 is any positive constant (kernels are unique up to scale) and a is chosen such 
that Aopt integrates to one. 

Theorem 3. Let A(aj) be any nonnegative kernel that integrates to one and satisfies 
all the necessary symmetries, i.e. satisfies ([T9|) for all a G Sk- Also assume 



u?^K{u)du= j io'j Aopt{i^) duj (27) 



-1 



for j = 1, . . . ,n - 1. Then \\A\\l2 > ||Aopt||L2' ^-e- 

/ A{ufduj> / Aopt(^)^ do;. 
Proof. Let A(a;) = Aopt(^) + then plugging this substitution into ([27j) gives 

/ Lu'^A{u:)du;= / u;"^ {Aopt{u:) + e{u:)) du = / tj^Aopt(^) (icj 



Therefore 



/ e(c^) (ia; = (28) 

for all j. Again making use of the identity in (|22p gives 



/ ujf £{u:) duj = \ / '^j ] ^{^) 

/ a;^ e(a;) du + 2 \^ / a;,- £{uj) du 

2 > / (jj LiJj e(a;) do; 
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Therefore 

V / uji LOj e{u:) du = 0. (29) 

Since A and Aopt both integrate to one, e must integrate to zero, i.e. 

/ e(^) duj = 0. (30) 
Computing the L2-norm of A(ti;) gives 

/ A{u;fduj= / Aopt(t^)^ (i^ + / e{ujfduj + 2 / Aopt{i^)e{u) du 

jRk-l jRk-l jRk-l jRfe-l 

So if we can show that last summand is nonnegative, we wih have proved the theorem. 
Referring to the definition of A(a^) in (j25p . we have 

/ Aoptii^)e{u) du = / Aopt(t^)e(<^) - / Aopt(t^)e(<^) (31) 

a;:Aopt(<^)<0 

The first integral is zero by ([28]), and ([30]). When A(cj) < 0, Aopt(<^) = and 

A(u;) > 0, so we must have e(cj) > 0. Therefore the second integral is less than zero 
making the left hand side of (jSip nonnegative. □ 

For k = 2, if we let /3 = 1/5, then a = and ()26p is the familiar Epanech- 
nikov kernel @|. This kernel is equivalent to the Bartlett-Priestly kernel in [l^ and 
corresponds to the quadratic spectral lag- window in [l[. For k = 3, (3 = l/vr^, and 
a = \^/7r'^, (j26p is the Gabr-Rao optimum bispectral kernel 17]. 

The set of points that satisfies Aopt('^) > is equivalent to 

fc-i ^ 

i=l i<j ^ 

where the left hand side is a positive quadratic form since 

fc-i ^ k-l ^ /k~i \ 2 

Y^u;^^ +Y^u^uj, = -Y^^ + l^iY^i] >0 (32) 

1=1 i<j 1=1 \i=l / 

and equal to zero only if u)j = for all j. Therefore there always exists some trans- 
formation of coordinates to transform the ellipsoid defined by Aopt(^) > to the unit 
sphere, i.e. a diagonalization of the quadratic form (j32p to standard form with rank 
and signature k — 1. This transformation is easily computed for any given k [3], and 
the Jacobian of the transformation can be used to determine a for a given (5 and k. 
For instance for A; = 4, the substitutions 



^^3 > \l 
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transforms + c<j| + w| + UI1UI2 + + 1^11^3 into + n| + The Jacobian of this 
transformation is \/2, and so a is computed to be 



a 



15^ 

8V2TT ' 



Polar coordinates on M*"' ^ can be used to exactly determine a for larger k 

6 Bivariate Examples 

The bivariate optimal kernel with corresponding lag-window are plotted below. 



Aiirt ,tl2 





Figure 1: Plots of Aopt(c^i, c^2) and Xopt{x,y). 

The kernel Aopt(wi,a;2) can be derived from (fTOj) (unlike dS]) as proved by Theorem 
1) by symmetrizing f{x,y) = \ — — with the symmetric function h being the 
arithmetic mean. Since the symmetrized / will be a quadratic form satisfying the 
appropriate symmetries, it must be of the form (j25p for some a and /3. 



Now let f{x,y) = (1 - 



y 



2^+ 



max(l — — y^,0). We produce three new 



lag-windows from the symmetric functions h = Y\xi, h = max(xi), and h = min(xj). 




Figure 2: / with h = Ylxi, h = max(xj), and h = min(xj). 
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We conclude with two flat-top (infinite-order) lag-windows since estimation with 
these windows are asymptotically superior The first example is a right pyramidal 
frustum with the hexagonal base \x\ + \y\ + \x — y\ = 2. We let c G (0, 1) be a scaling 
parameter that dictates when the frustum becomes flat, that is, the flat-top boundary 
is given by |x| + |?/| -|- |x — y| = 2c. The equation of this lag-window is given by 



1 



1 



Arp(n,''"2) 



1 



-A, 



n T2 



C V C C 



where Ap is the equation of the right pyramid with base \x\ + \y\ + \x — y\ =2, i.e., 



Arp(2;,y) 



(1 - max(|x|, |y|))" 



T < X, y < or < x, y < 1 



(1 — max(|x -|- 2/1, |x — yl))"'', otherwise 



The second example is the right conical frustum with elliptical base x"^ — xy + y'^ = 1. 
Again c G (0, 1) is a scaling parameter, and the lag-window becomes flat in the ellipse 
x"^ — xy + y'^ = (? . The equation of this lag- window is given by 



Arcf(n,T2) 



1 



1 



-Arc(Ti,r2) 



Arc I — , — 

1 — c V c c 



where A^ is the equation of the right cone with base — xy -|- = 1, i.e., 



Arc(a;,y) = (1 - \/ 



x^ 



xy + y^ 



These two examples are plotted below with c = 1/2 in each case. 




Figure 3: Lag-windows Arpf and Arpf. 



7 Conclusions 



This paper elicits a deep connection across permutations of Sk, symmetries of lag- 
windows and kernels, and invertible matrices over M. The structure provided in group 
theory gives a clear understanding to many of the basic elements of higher-order spec- 
tral analysis. In particular, the symmetries of the auto-cumulant function are now well 
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understood, a general prescription for producing multivariate lag-windows and kernels 
is given, and the Gabr-Rao optimal kernel is easily generalized with the aid of basic 
properties of the symmetric group. 
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